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Abstract 

This article reports on second language perception of non-native contrasts. The 
study specifically tests the perceptual assimilation model (PAM) by examining 
American learners’ ability to discriminate Arabic contrasts. Twenty two native 
American speakers enrolled in a university level Arabic language program took part 
in a forced choice AXB discrimination task. Results of the study provide partial evi¬ 
dence for PAM. Only two-category contrasts followed straightforwardly from PAM ; 
discrimination results of category-goodness difference and both uncategorizable 
contrasts yielded partial support, while results of uncategorized versus categorized 
contrast discrimination provided counter-evidence to PAM. 

Keywords : discrimination of sound contrasts, Arabic consonant contrasts, 
perceptual assimilation model 


It has long been recognized that second language (L2) speakers perceive 
and produce non-native sounds under the influence of their first language (LI) 
sound systems. Polivanov (1931) and Trubetzkoy (1939) were among the earliest 
to view the native language phonological system as a filter through which L2 
sounds are perceived and produced. Lado (1957) formalized this approach in his 
contrastive analysis hypothesis, which predicts L2 learning difficulty only in sounds 
and structures different from those in the LI. The poor discrimination of English IV 
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and/r/by Japanese speakers of English (e.g., Best & Strange, 1992; Yamada, 1995) 
is a classic example of how learners show perceptual difficulty with new contrasts. 

However, it soon became clear that a simple contrastive comparison be¬ 
tween LI and L2 sounds is inadequate (Eckman, 1987; Gass & Selinker, 2001) 
as it falls short of explaining why some L1-L2 differences are easy to learn and 
why, on the other hand, some L1-L2 similarities still pose a great amount of 
difficulty for L2 learners (Towell & Hawkins, 1994). Flege (1987) showed, for 
example, how English speakers of French produce Ll-similar French /u/ with 
second formant values significantly different from native French norms while 
their production of Ll-different French /y/ is native-like (similar findings in 
Flege, 1995; Flege, Bohn, & Jang, 1997). In perceiving non-native segments, 
English listeners demonstrate excellent discrimination of novel Zulu click con¬ 
trasts which do not exist phonemically or phonetically in English (Best, 
McRoberts, & Sithole, 1988), but poor discrimination of Zulu plosive versus 
implosive bilabial stops /b/-/6/, even though /b/ exists in English. 

Consequently, several models of speech perception and production 
have been formulated to better predict areas of difficulty in learning a foreign 
language sound system. Flege's (1987, 1995) psychoacoustic speech learning 
model claims that the ability to perceive and produce L2 sounds in a native¬ 
like manner is contingent upon the establishment of L2-separate abstract 
phonetic categories, 1 the forming of which is regulated by the similarity rela¬ 
tionship between LI and L2 sounds: "The greater the perceived distance of an 
L2 sound from its closest LI sound, the more likely it is that a separate catego¬ 
ry will be established for the L2 sound" (Flege, 1987, p. 264). The speech learn¬ 
ing model predicts, contra a classical contrastive analysis, that similar non¬ 
native sounds will be problematic for learners, whereas new sounds, different 
from any existing LI sound, will be easily acquired. The perceptual assimilation 
model (PAM) advanced by Best et al. (1988) and Best (1995) claims that con¬ 
trast discrimination varies gradiently as a function of the similarity between 
native and non-native sounds. 

In this study I test the predictions of PAM as they relate to the percep¬ 
tion of Arabic consonants by American English (AE) speakers. The paper is or¬ 
ganized as follows. First, I lay out some of the basic tenets and claims of the 
model. After discussing the relationship between Arabic and English pho¬ 
nemes, the predictions of PAM with regard to 9 Arabic consonant contrasts 
are outlined and their testing in a perceptual experiment involving AE learners 


1 Flege (1995, p. 239) defines phonetic categories as "language specific aspects of speech 
sounds [which] are specified in long term memory representations." 
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of Arabic is reported. An appraisal of the results in light of the model follows. 
The paper concludes with a summary of the main findings. 

The Perceptual Assimilation Model 

The perceptual assimilation model assumes that listeners have direct 
access to relevant information in the speech signal without the need for in¬ 
termediate representations. It maintains that auditorily listeners perceive 
speech sounds in terms of the gestures necessary to articulate them (Best, 
1995; Fowler, 1989). The model has undergone a number of revisions since it 
was first developed to account for English speakers' rather unexpected excel¬ 
lent discrimination of non-native Zulu clicks in Best et al. (1988). M ost of the 
discussion that ensues is mainly based on Best (1995). 

The perceptual assimilation model seeks to explain the gradient success 
listeners demonstrate in perceiving and discriminating non-native segments 
and contrasts. According to PAM, the ability to discriminate various non-native 
contrasts follows from implicit or explicit assimilation of each contrasting 
segment to a native category. A regulating factor in determining assimilability 
is the degree of phonetic closeness or discrepancy which native and non¬ 
native sounds share, as stated in Best (1995): 

The fundamental premise of the perceptual assimilation model of cross-language 
speech perception is that non-native segments, nonetheless, tend to be perceived 
according to their similarities to, and discrepancies from, the native segmental con¬ 
stellations that are in closest proximity to them in native phonological space, (p. 139) 

The degree of similarity between native and non-native phonemes is defined 
by "the spatial proximity of constriction locations and active articulators and 
by similarities in constriction degree and gestural phasing" (p. 194). A non¬ 
native phoneme is more likely to be assimilated to a native one when it is per¬ 
ceived as a good exemplar of its native equivalent. A non-native phoneme can 
be assimilated as (a) an existing native speech sound perceived as being iden¬ 
tical, acceptable or deviant exemplar of the native category, (b) a speech 
sound within the phonological space, but not representative of any particular 
native category, or can be heard as (c) a non-speech sound and therefore out¬ 
side of the native phonological space. 

Given that contrast discriminability in PAM is predictable from the as¬ 
similation of each segment in the contrast, the different combinations of (a)- 
(c) result in the following pairwise assimilation types, each with its predicted 
level of discriminatory accuracy (Best, 1995, p. 195): 
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1. Two-category assimilation (TC type): Each non-native segment is as¬ 
similated to a different native category, and discrimination is expected 
to be excellent. An example of this type is the Tigrinya ejective con¬ 
trast between the voiceless alveolar /t’/ and bilabial /pV, assimilated to 
the English alveolar-bilabial contrast/t/-/p/, respectively (Best, 1993). 

2. Category-goodness difference (CG type): Both non-native sounds are 
assimilated to the same native category, but they differ in discrepancy 
from the native "ideal" (e.g., one is acceptable, the other is deviant). 
Discrimination is expected to be moderate to very good, depending on 
the magnitude of difference in category goodness for each of the non¬ 
native sounds. The voiceless ejective and non-ejective velars /k’/-/k/ in 
Zulu are likely to be treated as voiceless velar /k/ in English, with Zulu 
/k/ as the good exemplar and ejective /kV as the deviant (Best, 1994). 

3. Single-category assimilation (SC type): Both non-native sounds are assimi¬ 
lated to the same native category, but are equally discrepant from the na¬ 
tive "ideal;" that is, both are equally acceptable or both equally deviant. 
Discrimination is expected to be poor (although it may be somewhat above 
chance level). Best (1994) gives the Thompson Salish contrast in ejective ve¬ 
lar /k7 and uvular /qV as a SC assimilation type where both sounds are likely 
to be perceived as deviant exemplars of prototypical English velar/k/. 

4. Both uncategorizable (UU type): Both non-native sounds fall within 
phonetic space, but outside of any particular native category, and can 
vary in their discriminability as uncategorizable speech sounds. Dis¬ 
crimination is expected to range from poor to very good, depending 
upon their proximity to each other and to native categories within na¬ 
tive phonological space. The well-known difficulty in distinguishing the 
English /l/-/r/ contrast by Japanese speakers reported in Best and 
Strange (1992) and Yamada and Tohkura (1992) can be an example of 
this type; neither liquid is assimilated to a good Japanese equivalent. 

5. Uncategorized versus categorized (UC type): One non-native sound is as¬ 
similated to a native category, and the other falls in phonetic space, outside 
native categories. Discrimination is expected to be very good. The English 
/r/-/w/ distinction for Japanese listeners fits this type where, unlike English 
/w/, which isassimilated as Japanese /w/, English /r/ is not assimilable to any 
Japanese category (Guion, Flege, Yamada, & Pruitt, 2000). 

6. Nonassimilable (NA type): Both non-native categories fall outside of 
speech domain being heard as non-speech sounds, and the pair can 
vary in their discriminability as non-speech sounds; discrimination is 
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expected to be good to very good. English speakers' discrimination of 
the Zulu clicks which, for an English listener, do not resemble any 
speech sound, falls into this category of assimilation (Best et al., 1988). 

The assimilation types just listed describe how non-native contrasts can vary in 
their discriminability as a function of the gestural (dis)similarity each member 
of the contrast bears to a native segment. Interference from the native phono¬ 
logical system can lead to a clear perceptual benefit in distinguishing contrasts 
when the contrast involves two separate, clearly defined categories in the 
native language (TC). It can cause, however, a perceptual detriment when the 
contrast elements correspond to one native phoneme (SC). Or it can be irrele¬ 
vant as in the NA type. The assimilation types TC, CG, and SC refer to non¬ 
native contrasts which can be assimilated to the native phonological space, 
and therefore follow from degree-of-similarity type (a). UU is concerned with 
contrasts that fall outside of the native domain but are still speech-like as 
stated in (b). UC is characterized by both (a) and (b), whereas a contrast of the 
NA type is solely based on (c). The different predictions made by PAM on the 
three assimilation types that occur within the native phonological domain sug¬ 
gest the following hierarchy of difficulty in the discrimination of non-native 
contrasts, where TC is easiest: TC >CG >SC. 

Several hypotheses of PAM have been tested. In one of the earliest stud¬ 
ies that have led to the development of PAM, Best et al. (1988) examined the 
discrimination of Zulu (a Bantu language) place and voicing click contrasts by 
native speakers of American English. Despite the fact that clicks do not exist 
phonemically or phonetically in English, an AXB task showed that English listen¬ 
ers had little difficulty in discriminating the Zulu click consonant contrasts, even 
when amplitude differences (a crucial acoustic cue for clicks) were leveled in a 
subsequent experiment. In both experiments adult English native speakers' suc¬ 
cess on the discrimination tasks, which amounted to 80% correctness in the 
natural condition and 78% in the modified one, is predictable as an NA assimila¬ 
tion type. Best et al. (1988) argue that since clicks are gesturally very distant 
from, and therefore cannot assimilate to, any English phoneme as their articula¬ 
tion involves ingressive suction followed by loud release, they were most likely 
perceived by English listeners as non-speech sounds that do not belong to any 
native category, in which case listeners were more reliant on the auditory and 
phonetic properties of clicks, which make them highly discriminable. 

M ore recently, Best, M cRoberts, and Goodell (2001) evaluated the predic¬ 
tions of PAM on the TC, CG and SC assimilation types by examining English listen¬ 
ers' perception of Zulu and Tigrinya consonant contrasts. For native English listen¬ 
ers the contrasts between the voiceless and voiced Zulu lateral fricatives/I/-/I 3 /as 
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well as the ejective bilabial and alveolar stops in Tigrinya /p’/-/tV were expected to 
fit a TC assimilation type, 2 in which case Zulu fricatives would be equated with 
English voiceless versus voiced apical fricatives and Tigrinya /p’/-/tV with non- 
ejective bilabial versus alveolar stops. Moreover, English listeners were predicted 
to show a CG type in their assimilation of Zulu voiceless aspirated and ejective 
velar stops /k h /-/k’/ to American English Ik/. Finally, a discrimination pattern con¬ 
sistent with SC was believed to emerge in the Zulu contrast between plosive and 
implosive voiced bilabial stops /b/-/6/, given that both sounds would most likely be 
perceived as English /b/. An AXB test showed all English listeners had more diffi¬ 
culty discriminating the Zulu aspirated-ejective contrast in velar stops than the 
voiceless-voiced contrast in lateral fricatives (89.4% vs. 95% correct discrimina¬ 
tion). Even worse, though not generalizeable to all subjects, was their ability to 
distinguish the plosive-implosive contrast in bilabial stops (65.9%). 

The advantage of Best et al.'s study is that it clearly shows how the PAM 
predictions for these three assimilation types were upheld within the same 
language and by the same group of listeners, eliminating the possibility that 
the observed pattern of discrimination was the result of different languages 
and/or subject populations. A second experiment using the same AXB para¬ 
digm revealed that English listeners' discrimination of the Tigrinya bilabial- 
alveolar place contrast was consistent with the Zulu voicing contrast: both TC 
types. In general, English listeners' performance on the non-native Zulu and 
Tigrinya contrasts followed straightforwardly from the patterns outlined in 1- 
3, along with their predicted discrimination levels, confirming the TC >CG >SC 
discriminability ranking suggested by PAM. 

Further evidence for PAM is found in Guion et al.'s (2000) assessment of 
the UU and UC types. Guion et al. hypothesize that the distinction between Eng¬ 
lish /r/ and IV would fall into the UU type (both uncategorizable) for Japanese 
listeners, given that the only liquid Japanese has is the alveolar retroflex tap 
[t], which is phonetically deviant from its English equivalent. Further, two more 
contrasts, English /r/-/w/ and /s/-/0/, were evaluated and assumed to follow the UC 
assimilation type based on the fact that Japanese /w/ and Is/ are good exemplars 
of their English equivalents. The findings of Guion et al. are generally in line with 
PAM; Japanese listeners were more successful in distinguishing English /r/ from 
/w/ in contrast to the poorly discriminated liquids. An exception was listeners' 


2 In determining what sound contrasts belong to which assimilation types, gestural similar¬ 
ities and differences among English, Zulu, and Tigrinya sounds were discussed in Best et al. 
(2001, pp. 778-779). 
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discrimination of /s/-/0/, which contrary to a UCtype prediction, was rather diffi¬ 
cult, arguably due to the phonetic proximity of the segments in the pair. 

An earlier study by Best and Strange (1992) involving Japanese speakers 
yields results that are different from those of Guion et al. (2000), yet con¬ 
sistent with the PAM predictions. In Best and Strange (1992), Japanese listen¬ 
ers perceived and discriminated English /r/-/l/ and /r/-/w/ as SC and CG con¬ 
trasts, respectively. However, it should be noted that in both studies the con¬ 
trast between /r/ and /w/, classified as either UC or CG, was consistently dif¬ 
ferentiated better than the liquid contrast, classified as UU or SC. Other cross¬ 
language speech perception studies also report findings that are supportive of 
PAM (e.g., Tigrinya ejective contrasts for English speakers in Best, 1990; Eng¬ 
lish /r/-/w for French speakers in Halle, Best, & Levitt, 1999; English final ob¬ 
struent voicing contrasts for Malay speakers in Pilus, 2002; and Hindi dental- 
retroflex for English speakers in Polka, 1991). 

Given the need for further evaluation of PAM on novel languages as as¬ 
serted by Best (1995, p. 198), the present study tests AE listeners' ability to 
successfully discriminate a number of diverse contrasts in L2 Arabic, a lan¬ 
guage not yet examined in light of PAM. 

Language Background 

Consonant Inventories of Arabic and English 

Modern Standard Arabic (or Arabic) is the official language of instruc¬ 
tion, media and science in the Arab world, and the target language for the 
majority of L2 Arabic learners in the US and elsewhere. For comparison pur¬ 
poses, the consonant inventories of Arabic and English are both outlined. 3 

The phonemic inventory of Arabic consonants is represented by a wide 
range of sounds, among which are the universally less frequent uvular, pharyngeal 
and glottal places of articulation. There are 28 consonant phonemes as shown in 
Table 1, adopted from Al-Ani (1970) and Ingham (1971) with modification. Eng¬ 
lish, on the other hand, has 24 consonant phonemes as illustrated in Table 2. 


3 The vowels of Modern Standard Arabic include /a/, /i/, /u/ and their long counterparts 
/a:/, /i:/ and /u:/. 


267 




Mahmoud S. Al Mahmoud 


Table 1 Consonant inventory of Arabic 



Stop 

Affricate 

Nasal 

Fricative 

Trill Approximant 

Bilabial 


b 


m 


w 

Labiodental 





f 


Interdental 





0 5 







6 


Dental 

t 

d 






t 

d 





Alveolar 




n 

s z 

r I 

Palatoalveolar 



d3 


s 

J 


Palatal 






j 

Velar 

k 






Uvular 

q 




X Y 


Pharyngeal 





h ? 


Glottal 


? 



h 



Note. Underlining represents emphatic consonants. Phonemes to the left in pairs are voiceless. 


Table 2 Consonant inventory of English (Ladefoged, 2001) 



Stop 

Affricate 

Nasal 

Fricative 

Central 

approximant 

Lateral 

approximant 

Bilabial 

P 

b 


m 



w 


Labiodental 





f 

V 



Dental 





0 

6 



Alveolar 

t 

d 


n 

s 

z 


1 

Palatoalveolar 



ff d3 


1 

3 



Retroflex 







i 


Palatal 







j 


Velar 

k 

9 


r) 





Glottal 





h 





Note. Phonemes to the left in pairs are voiceless. 


Comparing the inventories of the two languages, there are a number of 
phonemes that are found in Arabic only but not in AE. These include the 
(pharyngealized) emphatics /a/, /s/, /d/,/t/; the uvulars /q/, ty], /y/; the 
pharyngeals /h/, AY and the glottal /?/. The phoneme /r/ is realized as the retro¬ 
flex approximant [j] by many American speakers of English (Ladefoged, 2001), 
but it is an alveolar (or sometimes dental) trill [r] for the majority of Arabic 
speakers (Al-Ani, 1970; Amayreh, 2003). M uch controversy surrounds the phar¬ 
yngeal sound AY. Although traditionally described as a voiced fricative, Al-Ani 
(1970, p. 62) has concluded on acoustical basis that it is best characterized as a 
voiceless stop visible on the spectrogram in the form of a 40-50 ms burst ac¬ 
companied by noise. Similarly, Thelwall (1990) argues that the pharyngeal pho- 
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neme AV is realized in many Arabic dialects as a pharyngealized glottal stop. 
However, Ladefoged and Maddieson (1996) consider/?/ an epiglottal fricative. 

The rest of the Arabic consonant phonemes exist in English although 
their phonetic realizations in each language may differ slightly. Because 
/t/, /d/, /0/, /6/, /t/, 16/, Ihl, Ikl, /q/, Ikl, h/J, and /y/ are the only subset of Ara¬ 
bic consonants examined in this study, the discussion below will be limited to 
how each of these consonants is realized phonetically, if any, in each lan¬ 
guage; their similarity to and discrepancies from English should aid in predict¬ 
ing how native speakers of English would assimilate them, when possible, to 
their native phonological categories. 

Similarity Between Arabic and English Consonants 

In determining if and how Arabic consonant contrasts would be per¬ 
ceived and assimilated to the English sound system the first step is to examine 
the native phonetic realization of each individual phoneme in each language, 
before even discussing what contrasts are to be tested. While this study 
acknowledges the similarity measure set forth by Best (1995), in which gestur¬ 
al specifications necessary for the proper articulation of each segment have to 
be assessed for the language pair in question, a detailed gestural account of 
Arabic consonants is not possible for a small scale study. The comparison be¬ 
low is based on the articulatory and acoustic measurements of Arabic conso¬ 
nants provided by Al-Ani (1970) as well as on other descriptive work in Arabic 
linguistics (e.g., Al-Karouri, 1996; Bateson, 1967). For English most of the dis¬ 
cussion is based on Ladefoged (2001, 2005). 

A first glance at the Arabic consonants examined in this study renders 
the following unequivocal classification, according to their phonemic status in 
the English inventory: 

• those that exist as separate phonemes in English: /t/, /d/, /©/, /5/ Ikl, /h/; 

• those that do not exist in English as phonemes: /t/, /a/, /q/, h/J, /y/ and Ikl. 
Among those that exist in English, Arabic /t/ and /d/ are often produced with 
the tip of the tongue touching the posterior part of the front teeth (i.e., the 
dental place of articulation; Al-Ani, 1970; Bateson, 1967), but according to 
Ladefoged (2001) their articulation in English involves a fully alveolar gesture. 
The stop /t/ is aspirated initially in both languages, but only in Arabic is it often 
released in final position. When aspirated the burst intensity for /t/ is concen¬ 
trated in the 3 kHz range for Arabic (Al-Ani, 1970, p. 45), but in English the 
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energy is in the higher 3-5 kHz (Ladefoged, 2005, p. 53) or 3.9 kHz 
(O'Shagaussey, 2000, p. 66) frequencies. 

In Arabic the fricatives /0/ and /Q/ are always described as being inter¬ 
dental sounds, produced with the tip of the tongue placed between the upper 
and lower front teeth (Al-Ani, 1970; Al-Karouri, 1996; Bateson, 1967). In Eng¬ 
lish, however, many native speakers produce them with the tongue tip placed 
at the back of the upper front incisors (dental), although for some American 
speakers they are produced interdentally (Ladefoged, 2005, pp. 119-120). For 
the velar IkJ, the most common realization in both languages is the voiceless 
aspirated stop made by bringing the tongue into contact with the velum alt¬ 
hough a palatalized or fronted variant occurs in Arabic when next to high front 
vowels. The energy associated with the velar aspiration has its intensity in the 
2-3 kHz range for both languages (Al-Ani, 1970, p. 32; Ladefoged, 2005, p. 52). 
The stop /k/ is often released initially in English, but in Arabic, like other plo¬ 
sives, it is released both in initial and final positions. The voiceless fricative N 
is characterized by random noise that is "caused by the movement of the air 
across the edges of the open vocal folds and other surfaces of the vocal tract" 
(Ladefoged, 2005, p. 58). The noise is most intense in the 3 kHz level for Eng¬ 
lish, and is slightly lower for Arabic, around 2.7 kHz. 

Among the phonemes that do not exist in English are the Arabic 
(pharyngealized) emphatics /t/ and /Q/, for which the most common allophones 
are a voiceless unaspirated post-dental stop and a voiced interdental fricative. 
The emphatic N is the pharyngealized counterpart of /t/, but unlike l\I it is often 
articulated with the tongue tip positioned further back, is unaspirated, and has a 
shorter burst duration (20-30 ms vs. 40-60 ms) concentrated usually at a slightly 
lower frequency range (1500-2400 Hz) than that of /t/ (1600-2700 Hz). When 
next to /a/, the second formant values are approximately 1500-1550 Hz for /t/, 
but drop to the 1150-1250 Hz range for/t/. The emphatic/5/, on the other hand, 
is more similar to its non-pharyngealized counterpart /6/: Both are produced 
interdentally and have the same noise duration (100-160 ms) with similar first 
and third formant resonances at around 275 and 2350 Hz, respectively. Howev¬ 
er, the second formant noise resonances for /S/ are between 900-1000 Hz, 
compared to /6/'s 1500 Hz. A dampening effect is also observed for the 
pharyngealized fricative/5/, but not for/5/, evident in the second formant onset 
values for an adjacent /a/ (1150-1200 Hz vs. 1500-1600 Hz). 

The rest of the phonemes which do not exist in English consist of the 
gutturals, which are in fact entirely new to English. The uvular /q/ surfaces as a 
voiceless unaspirated stop acoustically resembling a typical stop accompanied 
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by a robust release burst spectrally visible around 3000 Hz followed by a peri¬ 
od of silence with a duration of 30-40 ms. The most common allophone of !yj 
is a voiceless uvular fricative which appears spectrally as random noise averag¬ 
ing 100-160 ms. When next to /a/, the fricative noise condenses around the 
1500 Hz baseline and is almost undetectable below 1000 Hz. A fronted velar- 
ized variant also exists when next to the high front vowels I\1 or/i:/. The frica¬ 
tive /y/ is the voiced counterpart of /yj and is realized either as uvular with /a/, 
/a:/, /u/ and /u:/, or as a fronted velar sound when next to the vowels /i/ and 
/i:/. This sound is similar to tyj in duration, but has its noise concentrated at a 
lower frequency range (1300 HZ). The pharyngeal /h/ is realized as a voiceless 
constricted fricative with an average duration of 100-150 ms. It is set off from 
/h/ by its somewhat constricted articulation which shows more intense noise 
often at lower third formant frequencies (Klatt & Stevens, 1969). The con¬ 
striction involved in producing /h/ is formed by bringing "the dorsum of the 
tongue against the posterior wall of the pharynx where the movements of the 
pharyngeal muscles play an important role" (Al-Ani, 1970, p. 60). When inter¬ 
vocalic, /h/, as /h/, tends to be voiced. In general, as Peterson and Shoup (as 
cited in Al-Ani, 1970) say, pharyngeal (and glottal) sounds are distinguished 
from other sounds by their "vertical" places of articulation (i.e., along the back 
of the throat from the palate to the glottis), as opposed to horizontal places of 
articulation which extend from the lips to the uvula. 

Testing PAM 

Determining L 2 Arabic Contrasts and their Assimilability to English 

Table 3 includes 9 Arabic consonant contrasts that were examined. 4 The 
pairs in the table are distinguished either by voicing (1-3), manner of articulation 
(4-6), or place of articulation (7-9). The manner contrasts are differentiated ei¬ 
ther by the feature [emphatic] (pharyngealized) as in 4-5 or [continuant] as in 6; 
the place contrasts differentiate the pharyngeal /h/ either from the glottal N as 
in 7 or from the uvular !yj as in 9, and the velar Ikl from the uvular /q/ as in 8. 


4 Initially some 20 pairs were tested, but since all involved the same contrasts tested here 
(i.e., the same place, manner or voicing), only a subset is reported on. 
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Table 3 Tested Arabic consonant contrasts 


No. 

Contrast 

Type of contrast 

Description 

1 

/t/-/d/ 

voicing 

voiceless vs. voiced 

2 

/0/-/8/ 

voicing 

voiceless vs. voiced 

3 

lyJ-Ni 

voicing 

voiceless vs. voiced 

4 

/t/-/t / 

manner 

emphatic vs. non-emphatic 

5 

I5/-/6/ 

manner 

emphatic vs. non-emphatic 

6 

/q/-/x/ 

manner 

plosive vs. continuant 

7 

/h/-/h / 

place 

pharyngeal vs. glottal 

8 

/k/-/q/ 

place 

velar vs. uvular 

9 

/x/-/h/ 

place 

uvular vs. pharyngeal 


Based on the previous discussion of the similarities and discrepancies be¬ 
tween Arabic and English in the realization of these sounds, I make the predictions 
shown in Table 4 with regard to the plausible assimilability of each Arabic conso¬ 
nant to its closest English counterpart. Given that no two sounds are exactly iden¬ 
tical in different languages (as is well known, but see e.g., Rochet, 1995), the as¬ 
similability shown in Table 4 is both approximate and predictive. It is hypothesized 
on the basis of the acoustic and articulatory phonetic subtleties between Arabic 
and English discussed in the previous section, as well as on the author's own ob¬ 
servations (teacher's observations) of AE speakers' substitution patterns when 
learning (perceiving and producing) these sounds of Arabic as a second language. 
In learning Arabic, the majority of English speakers show a clear and consistent 
tendency to replace the emphatics Itl and /6/ with their non-emphatic counter¬ 
parts, namely Itl and /5/. In addition, though less consistently, they substitute the 
velar Ikl for the uvular/q/, and the glottal /h/ for the pharyngeal /h/. 5 

Table 4 Assimilability of Arabic consonants to English 


Arabic 

Assimilable to English 

Goodness of fit 

70/ 

/e/ 

excellent exemplar 

/a/ 

/a/ 

excellent exemplar 

Ikl 

/k / 

excellent exemplar 

/h/ 

/h/ 

excellent exemplar 

/t / 

Itl 

good exemplar 

/d/ 

Id/ 

good exemplar 

Itl 

Itl 

poor exemplar 

/a/ 

/a/ 

poor exemplar 

/q/ 

Ik/ 

very poor exemplar 

/h/ 

/h/ 

very poor exemplar 

/%! 

/h/ 

extremely poor exemplar 

I'll 

Zg/ 

extremely poor exemplar 


5 Sometimes /g/ and /%/ are substituted for /q/ and /h/, respectively. 
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Perceptually, Arabic /t/ and /d/ are expected to represent good exemplars of 
English /t 1 and /d/, since in each language only slight discrepancies exist in the ca¬ 
nonical articulation of both. Still, the two sounds are predicted to assimilate to 
two separate corresponding categories in English. The sounds /0/, /a/, Ikl and /h/ 
are predicted to be perceived by AE listeners as excellent instances of their English 
equivalents (closest to being perceived as identical), as there are minimal to no 
differences between their realization in each language. The emphatics /t/ and /a/ 
are expected to sound close to non-emphatic It/ and /a/, just as the pharyngeal /h/ 
most likely resembles its unconstricted glottalized counterpart, M 6 

The Arabic voiceless uvular /q/ is predicted to be heard by AE listeners as a 
very poor exemplar of English Ikl. This prediction is based not only on the rela¬ 
tive acoustical similarity discussed earlier, but also on the fact that in the speech 
of many English learners of Arabic /q/ is fronted to Ikl, as mentioned above. 
Even in some non-standard Arabic dialects, the voiceless uvular /q/ is almost 
always substituted for the voiced velar /g/ as in Gulf Arabic (e.g., spoken in Saudi 
Arabia, Kuwait, UAE, Qatar, etc.). The uvular fricatives tyj and /y/, although in 
rare cases produced as/k/ and /g/ by beginner AE learners of Arabic, bear no 
clear resemblance to any particular category of the English phonological system 
and are deemed, therefore, perceptually non-assimilable to any particular cate¬ 
gory in English; nonetheless, the guttural gesture (which subsumes uvulars) 
does exist in English in some allophonic environments and must therefore be 
included within the native phonological space of English (Best, 1995). 

Stating the PAM Predictions 

Considering the Arabic contrasts stated in Table 3 and based on the pre¬ 
dicted assimilability of each contrast member to its closest English equivalent 
in Table 4, four assimilation types associated with PAM can be identified in the 
classification of these contrasts. 

The first assimilation type that can be discerned in Table 3 is the TC type 
in which both non-native phonemes assimilate to two separate categories in 
the native language. The Arabic voicing contrasts /t/-/d/ and /0/-/Q/ seem to fall 
well into this type, as every member in these contrasts is assimilable to a sin¬ 
gle English phoneme. In other words, the two contrasts exist both in Arabic 


6 Note also that to represent the emphatics and the pharyngeal, IPA uses the symbols 
/t/, /S/ and /h/ respectively, which are representationally isomorphic to non-emphatic 
It/, /6/and glottal /h/, perhaps suggestive of their phonetic closeness. 
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and English. The perceptual assimilation model predicts contrast discrimina¬ 
tion in this case to be highly efficient. The second deducible type is CG, accord¬ 
ing to which the two non-native sounds are perceived by the listener as in¬ 
stances of one native phoneme, but they vary in their goodness of fit: One is 
acceptable, the other is not. I consider the contrasts between emphatics and 
non-emphatics in /t/-/t/ and /S/-/6/ as examples of the CG assimilation type for 
which discriminability would be "moderate to very good," as stated in PAM. 

On the other hand, the pharyngeal-glottal /h/-/h/ and velar-uvular /k/-/q/ 
contrasts appear to fit the UC assimilation type. In either contrast, one mem¬ 
ber is clearly equatable with English: Arabic /h/ and /k/ are almost identical to 
English Ihl and /k/. The other members /h/ and /q/ fall within the English pho¬ 
nological space, yet lack clear correspondents, although anecdotal evidence 
suggests that English speaking learners of Arabic poorly realize them as 
/h/ and /k/. Accordingly, it is possible to treat the Ihl-Ihl and /k/-/q/ contrasts 
as another example of CG assimilation, with Arabic /h/ and /k/ being the ac¬ 
ceptable and /h/ and /q/ the deviant exemplars of English /h/ and /k/. 

However, given the less frequent phonemic substitution of /h/ and Ikl for 
Ihl and /q/ in the speech of American learners (cf. /t/-/6/ for /t/-/6/), I believe it is 
more fitting for these two pairs to be classified as UC rather than CG contrasts. 
This ambiguity in the categorization of IhJ-lhl and /k/-/q/ pairs is quite reminis¬ 
cent of the English /r/-/w/ contrast for Japanese speakers, categorized in Best 
and Strange (1992) as a CG type, but as a UC type by Guion et al. (2000). Need¬ 
less to say, the suggestion made in this study is more along the lines of Guion et 
al. As far as discriminability is concerned, both assimilation types make similar 
predictions associated with the contrasts: very good contrast discrimination in 
UC, and moderate to very good in CG. 

The last three types of contrasts in Table 3, namely the voicing contrast in 
h/J- /y/, the uvular versus pharyngeal in /%/■ Ihl and the plosive versus continuant 
in /q/- /%/, show a UU assimilation type. The contrast discriminability which PAM 
predicts for this type of assimilation depends on how well each segment is 
assimilable, if any, to the native language as well as on the phonetic closeness 
between the contrast members, and can range, therefore, from poor to very 
good. In each of these contrasts neither consonant clearly assimilates to a spe¬ 
cific category in the English language. This is especially true for the uvulars !yj 
and /y/, which do not correspond to any English sound. The sounds Ihl and /q/, 
on the other hand, can be equated with Ihl and Ikl, although not commonly, as 
mentioned before. Thus, based on their assimilability to the learner's native 
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language in this study, it is hypothesized that within the UU assimilation type 
the contrast lyj-lyl would be less discriminable than lyj-lh/ or loj-lyl. 

Table 5 sums up the contrasts tested in this study, their assimilation types, 
and the predicted discrimination level associated with each type. The perceptual 
assimilation model predicts that theTC contrasts/t/-/d/ and /0/-/Q/ will be better 
discriminated than any of the other contrasts in this study. Another prediction 
that is testable is the increased indiscriminability of /%/-/y/ relative to the other 
two UU contrasts, lyj-lhl and Iql-lyl. 

Table 5 PAM's predictions for the Arabic contrasts 


Contrast 

Assimilation type 

Predicted discriminability 

/t/-/d/ 

TC 

excellent 

/0/-/6Z 

TC 

excellent 

/t/-/t/ 

CG 

moderate to very good 

/a/-/a/ 

CG 

moderate to very good 

/h/-/h / 

UC 

very good 

/k/-/q/ 

UC 

very good 

ixi-iyi 

UU 

poor 

/x/-/h/ 

UU 

very good 

/q/-/%/ 

UU 

very good 


M ethod 

The relevant predictions of PAM stated above were tested in an auditory 
perception experiment carried out to assess AE learners' perception of the 
various Arabic phonemic contrasts, summarized in Table 3. A perception ex¬ 
periment, as opposed to a production one, is believed to provide a better 
measure of acquisition or learnability since it involves less conscious 
knowledge of what is being learned (Archibald, 1998; Larsen-Hall, 2004). 

Participants 

Twenty two American learners of Arabic participated in the perception 
experiment. The participants were all native speakers of American English and 
their average age was 22. They were year 1 students at a US college enrolled 
in the university's Arabic program at the time of the study. The majority of the 
participants had knowledge of a third language, mainly Spanish or French. It 
was hard to find participants who spoke English and Arabic only since most 
had been exposed to a third language in high school. Werker (1986) shows 
that trilinguals discriminate non-native contrasts no better than bilinguals do. 
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In other words, it is believed that knowledge of a third language does not sig¬ 
nificantly affect contrast discrimination. 

Participants who received any specialized training in pronunciation or 
phonetics and those whose length of residence in an Arabic-speaking country 
exceeded 6 months were excluded from the study. In addition, participants for 
whom Arabic was considered a heritage language as well as those reporting 
any hearing difficulties were precluded. 7 All subjects were compensated $10 
each for their participation in the experiment. According to self-report, no 
participant had any hearing difficulties. 

Materials and Task 

Initially 20 phonemic consonant contrasts were tested (see the Appen¬ 
dix). However, only those mentioned in Table 3 are reported on here. In this 
study, the term phoneme refers to the allophone that is used in isolation and 
is usually more common than other variants, and which Daniel Jones (1967, p. 
8 ) considers as the "principal member" or the "norm" of the phoneme. The 
pairing of phonemic contrasts in Table 3 depends on potentially confusable 
consonants that share major place-of-articulation features (coronal, dorsal, 
guttural). In addition to the nine contrasts in Table 3, four more contrasts 
were used as distracters: /m/-/n/, /r/-/l/, /w/-/j'/, and /CI3/-/JV. 

With an AXB discrimination task, four test items (AAB, ABB, BAA, BBA) 
were generated for each of the 24 contrasts yielding a total of 96 randomly 
ordered test trials. Every test trial was a triad consisting of three disyllabic 
nonsense words following the template ?a:'Ca: (where C is consonant), with 
stress being placed systematically on the second syllable, for example, 
?a:'ta:-?a:'ta:-?a: l ta:. Thus a total of 288 test words were used in the discrimi¬ 
nation task (96 x 3 =288). Compared to syllable onsets or codas, the intervo¬ 
calic position was chosen for its ideal environment in the perception of conso¬ 
nants, especially stops (Wright, 1995, p. 35). Also, word-final stop contrasts 
can be difficult for English listeners to detect due to the fact that stops tend to 
be unreleased in that position (Selkirk, 1982). The use of nonsense test tokens, 
as opposed to real words, was intended to minimize any effects word fre¬ 
quency and familiarity may have on L2 learners' discriminative ability. 

A native speaker of Arabic trained in linguistics (the present author) 
produced the test words, which were transcribed in IPA to ensure a more ac- 


7 Initially 24 participants were tested; however, 2 were excluded from the study: one for whom 
Iraqi Arabic turned out to be a heritage language (parents’ native language), and the other had 
stayed in an Arabic-speaking country (Egypt) for over a year and was married to an Arab. 
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curate pronunciation. The stimuli were digitally recorded in a sound treated 
lab using Audacity (Audacity Team, 2008) recording and editing software (ver¬ 
sion 1.2.4) and a clip-on PRO 7 Electret condenser microphone on a Windows 
Vista Dell 1420 laptop computer. 

Procedure 

An AXB forced-choice discrimination paradigm was used to elicit Ameri¬ 
can learners' perception of the different Arabic contrasts in this study. An AXB 
discrimination task provides a reference point (i.e., X) against which the simi¬ 
larity of stimuli can be gauged by listeners, as opposed to a simple AX discrim¬ 
ination task where listeners may base their same/different responses on non- 
linguistic factors (Beddor & Gottfried, 1995). 

In a quiet library room setting, aural stimuli were presented randomly 
over headphones (Koss R80) to each participant individually. Each test trial pro¬ 
ceeded as follows. The participant listened to all three tokens in each triad (e.g., 
?a:'ta:-?a:'ta:-?a: l ta:) and had to indicate on an answer sheet provided whether 
the first or third word was the same as the second. An inter-stimulus interval 
(ISI) of 1000 ms followed each token. Longer ISI is believed to encourage pho¬ 
nemic rather than phonetic perception of non-native contrasts (Werker & Lo¬ 
gan, 1985). A 3000 ms inter-trial interval separated each trial from the following 
one. The 96 test trials were administered over two sessions (48 trials each). In 
order to ensure that each subject understood the procedure, a 3-item practice 
test was administered to each participant prior to the experiment. For each 
participant, the experiment lasted an average of 20 min. 

Results 


By Contrast 

Discrimination scores from the AXB task were collapsed across all 22 
subjects and pooled for each of the 9 phonemic contrasts stated in Table 3. To 
determine whether differences in discriminability were significant among the 
contrasts, discrimination scores were submitted to a repeated measures (with¬ 
in subjects) ANOVA, which tested for the significance of the independent vari¬ 
able, that is, contrast (9 levels). The results indicated a significant effect of 
contrast, F(8,168) =63.80, p <.001, n P 2 =.75. To find out which pairwise com¬ 
parisons are significant, a series of Bonferroni post-hoc tests compared dis¬ 
crimination scores across all 9 levels of the contrast variable. Table 6 sums up 
discrimination success rates for each individual contrast as well as the 
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(in)significance of each pairwise comparison. The leftmost column presents 
percentages of correctly discriminated contrasts. This is depicted graphically in 
Figure 1. For instance, best discrimination involved the TC contrasts /t/-/d/ and 
/0/-/Q/, for which performance was native or near-native. On the contrary, 
poorest discriminability belonged to the UU contrast /x/-/y/- Table 6 also 
shows whether pairwise comparisons (second column vs. top row) among the 
different contrasts were significant or not. For example, while the difference 
between /t/-/d/ and /0/-/Q/, both TC types, was not significant, the difference 
between these and all other contrasts was. In other words, /t/-/d/ and 
/0/-/Q/ patterned similarly in being significantly more discriminable than any 
other contrast. With some exceptions, it appears that discriminability, to a 
large extent, varied as a function of the assimilation type. 


Table 6 Arabic contrast discrimination results by AE listeners 


Discriminability 

Contrast /0/-/3/ 

Itl-ltl 

/a/-/3/ 

/h/-/h/ 

/k/-/q/ 



loj-lyj 

100 

/t/-/d/ ns 

* 

* 

* 

* 

* 

* 

* 

98.9 

/G/-/3/ 

* 

* 

* 

* 

* 

* 

* 

64.8 

/t/-/t/ 


ns 

* 

ns 

* 

ns 

* 

72.7 

/3/-/5/ 



* 

ns 

* 

* 

ns 

37.5 

/h/-/h/ 




* 

ns 

* 

* 

64.8 

/k/-/q/ 





* 

ns 

* 

36.4 

IfJ-l'il 






* 

* 

54.5 

lyj-faj 







* 

82.9 

loj-tyj 









ns insignificant at the level of .05. 
* p <.05. 




Contrast 

Figure 1 AXB results in the discrimination of Arabic contrasts 
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By Assimilation Type 

Next, contrast discrimination rates were tallied up and averaged for each of 
the four assimilation types TC, CG, UC and UU, given the a priori classification in 
Table 5. A repeated measures ANOVA with the independent variable of assimila¬ 
tion (4 levels) revealed a significant main effect for assimilation, with F{ 3, 63) = 
76.59, p <.001, n P 2 =.78. Except for the pairing of UC and UU, which turned out to 
be insignificant, p >.05, post-hoc Bonferroni tests indicated significant differences 
among all pairwise comparisons with a high confidence level (p < .001). This is 
summarized in Table 7 and illustrated in Figure 2. Averaged discrimination rates 
show that TC contrasts were significantly more discriminable than other assimila¬ 
tion types. Similarly, CG contrasts had higher overall discriminability than UC or 
UU contrasts. Although the lowest discrimination rate was found in the UC type of 
assimilation, it did not differ significantly from that of UU. 


Table 7 Discrimination results of the Arabic contrasts by assimilation type 


Contrast 

Discriminability 

Assimilation type 

CG UC UU 

/t/-/d/, /0/-/6/ 

99.4 

TC 

* * * 

/t/-/t/, /5/-/a/ 

68.7 

CG 

* * 

/h/-/h/, /k/-/q/ 

51 

UC 

ns 

/xA/y/, /q/-/x/ 

57.9 

UU 



ns insignificant at the level of .05. 
* p <.05. 



1C CG lie 


Assimilation Type 

Figure 2 AE listeners'discriminability of Arabic contrasts according to assimilation type 
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Discussion 

A cursory look at Table 6 shows that AE listeners' ability to distinguish 
non-native contrasts varies significantly as a function of whether or not the 
two contrasted segments are assimilable to the learner's native language. 
With few exceptions to be discussed, contrast discrimination seems to gener¬ 
ally follow from the PAM predictions stated in Table 5, with regard to assimila¬ 
tion type and discriminability. 

/t/-/d/ 


The contrast between the voiceless /t/ and voiced /d/ dental stops was 
predicted to follow a TC assimilation type where each segment in the contrast 
is assimilated to a separate native category, resulting in excellent discrimina¬ 
bility. Looking at Table 6, this prediction is borne out; listeners discriminated 
correctly 100% of the time, and, as post-hoc tests show, their discrimination 
was significantly better than any other non-TC contrast. 

/0/-/d/ 

The voicing contrast between interdental /0/ and /5/ was also predicted 
to show TC assimilation. Results confirm this; subjects were able to discrimi¬ 
nate /0/ from 16/ accurately most of the time (98.9%). As in /t/-/d/, this result 
was significantly better than the rest of the contrasts. 

TC Assimilation 

As predicted by PAM, both Arabic /t/-/d/ and /0/-/Q/ voicing contrasts be¬ 
haved similarly in conforming to the TC. They both displayed the highest rate of 
discrimination and were significantly more distinguishable than other contrasts. 
The insignificant difference in discrimination between these two contrasts sug¬ 
gests that to American listeners /t/-/d/ was as distinctive as /0/-/5/. The success 
rate averaged for these two contrasts is 99.4%, which is significantly the great¬ 
est among other assimilation types. This pattern is in line with the findings of 
Best et al. (2001), for example, who demonstrated excellent (95% correct) TC 
contrast discrimination between Zulu voiceless-voiced fricatives / 4 /-/I 3 / and 
Tigrinya ejective bilabial-alveolar stops /p'/-/t'/ by native listeners of English. 
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Itl-ltl 


It was predicted that the Arabic emphatic contrast Itl-ltl would be as¬ 
similated as a CG type by native speakers of English where, depending on the 
goodness of fit, discriminability can range from moderate to very good. Results 
show that listeners had a somehow poor success rate in discriminating 
Itl from It/ (64.8%). In addition, discrimination here did not differ significantly 
from that of UC /k/-/q/ or UU h/J-lhl, perhaps due to the unpredictably poor 
performance of the latter as well, as will be discussed later. 

/<$/-/ 6 / 

In contrast to Itl-ltl, the distinction between the Arabic emphatic and 
non-emphatic interdental fricatives was perceived better. Listeners more suc¬ 
cessfully distinguished 16/ from 16/ 72.7% of the time, compared to 64.8% for 
Itl-ltl, although the difference between the two emphatic contrasts did not 
reach significance. However, discriminability for 161-/6/ was significantly better 
than other contrasts except for /k/-/q/, for which discrimination was, as men¬ 
tioned earlier, counter-predictably low, and loj-lyj, a UC type for which PAM 
makes a similar prediction of very good discrimination. 

CG Assimilation 

The pattern of perception for the /5/-/5/emphatic contrast appears to fit 
the discriminatory level hypothesized for CG assimilation. Listeners' diminished 
ability to differentiate Itl-ltl compared to /6/-/Q/, however, could have resulted 
from It/ being perceived as a better fit for Itl than 161 for 161. In other words, to 
listeners Itl was perceptually closer to Itl than 16/ to 161, leading to greater con- 
fusability in the pair /t/-/t/, and thus to lesser discriminability. When combined, 
discrimination averages 68.7% for the two CG contrasts. This rate is much lower 
than what is reported in Best et al. (2001) for English listeners who showed a 
much higher success rate (89.4%) in discriminating the Zulu CG voiceless aspi¬ 
rated and ejective velar stop contrast /k h /-/k'/. It seems, therefore, that the 
predictions of PAM concerning CG assimilation are partially supported by the 
emphatic contrast between the interdental fricatives 161 and 161. 
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/h/-/h/ 

It was hypothesized that the pharyngeal-glottal contrast would exempli¬ 
fy a UC assimilation for which discrimination should be very good. This predic¬ 
tion did not transpire, however, as AE listeners had great difficulty in distin¬ 
guishing these two sounds with only 37.5% of the contrasts being correctly 
discriminated. Except for ///-/y/, discrimination of /h/-/h/ was significantly 
worse than all other contrasts, particularly UC /k/-/q/. 

/k/-/q/ 

The velar-uvular contrast was also hypothesized to show very good dis¬ 
crimination, typical of UC assimilation. Although performance significantly 
improved on this contrast compared to the pharyngeal-glottal, still discrimina¬ 
tion was unpredictably poor hovering around the 64.8% range. 

UC Assimilation 

The manner in which AE listeners discriminated the pharyngeal-glottal 
/h/-/h/ as well as the velar-uvular /k/-/q/ contrasts did not conform to the PAM 
predictions for UC assimilation; that is, neither contrast showed very good dis¬ 
crimination although for /h/-/h/discriminability was poorer but considerably 
better for/k/-/q/. Further, discrimination of the two contrasts added up to 51%, 
which is rather poor and counter-predictive. It is possible that the phonetic 
proximity of the segments, as evidenced by the phonemic substitution of /h/ 
and /q/ for /h/ and /k/, respectively, has led to the poor discrimination of these 
UC contrasts. Guion et al. (2000), for example, report similar results for Japa¬ 
nese listeners, who, although successful in discriminating English /r/-/w/, were 
not able to differentiate the English UC contrast /s/-/0/ accurately. 

The voiceless-voiced uvular contrast lyj-l\i was predicted to be a UU 
assimilation, in which, according to PAM, discriminability can range from poor 
to very good depending on how well each segment is assimilated, if at all, to 
the native language, as well as on phonetic similarity between members of the 
contrast. Recall that within the UU contrasts examined /x/-/y/ was predicted 
to be the least discriminable. This was borne out; native AE listeners experi- 
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enced great difficulty in telling these two sounds apart. They successfully dis¬ 
criminated only 36.4% of the stimuli, the worst of all contrasts. 

/yj-/h/ 

PAM predicted the uvular-pharyngeal ///-/h/ contrast to be a well- 
discriminated UU assimilation, since at least one segment (i.e., /h/) is remotely 
assimilable to English /h/. Results show this was not the case, however, with 
AE listeners discriminating ///-/h/ poorly, just above the chance level (54. 5%). 

/q/-/%/ 

The uvular plosive-continuant contrast was predicted to be a UU assimi¬ 
lation as well. Because English /k/ can be representative, although poorly, of 
Arabic /q/, the contrast was expected to be differentiated rather well. This 
prediction was borne out. American listeners demonstrated very good, 82.9%, 
discrimination of the contrast. 

UU Assimilation 

Of the three UU contrasts investigated in this study, h/J-lyt and Iql-lyJ 
support the PAM predictions. Averaged discriminability across all three con¬ 
trasts is 57.9%, significantly lower than TC and CG assimilation types, but not 
UC, for which discriminability was even worse. As for CG contrasts, it is hard to 
assess the overall success rate of UU contrasts because of the range of vari¬ 
ance in discriminability set by PAM (moderate to very good in CG, poor to very 
good in UU). Therefore, it is best to examine each contrast individually, as 
done so far, in order to evaluate the relevant PAM predictions. The results for 
the UU assimilation type as a whole in this study corroborate those of Guion et 
al. (2000), whose Japanese subjects, consistent with UU assimilation, failed to 
perceive the distinction between English /r/ and III. 

In general, the data support the hypothesis that L2 Arabic contrasts are 
perceived by AE listeners within the confines of their native language phono¬ 
logical system. Their ability to discriminate the Arabic contrasts in this study 
relied crucially on how similar or different (assimilable or not) the contrastive 
sounds are to the listeners' native language. L2 Arabic contrasts involving 
sounds that are identical or similar to English tended to be perceptually much 
more discriminative as in /t/-/d/ and /B/-/5/. Other contrasts varied in their 
discriminability from poor (UC /h/-/h/, UU lyl-lyl and h/J-lhl) to good (CG 
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/t/-/t/, UC /k/-/q/) to very good (CG /6/-/5/, UU Iql-lyJ) as an apparent func¬ 
tion of their assimilability to the listener's native language as well as the pho¬ 
netic proximity of the contrasted segments to each other. 

Such effects of the native language sound system on contrast discrimi¬ 
nation and more generally speech perception have been well established. The 
loss of sensitivity to foreign contrasts has been attributed to interference from 
the ambient (native) language and thus, to use Kuhl's (1993) terminology, the 
once "citizen of the world" becomes a "culture-bound" perceiver. For adults, 
continued exposure to the native language, while rendering their auditory 
system more attuned, and thus confined, to the native language sounds, costs 
them the ability to detect minor phonetic differences that subsequently arise 
in learning a second language sound system. Strange (1995) summarizes the 
L2 perception experience as follows: 

Between early infancy and adulthood, then, children’s interactions with their linguis¬ 
tic environment while acquiring their first languages produce significant changes in 
the perception of speech sounds. There is a "loss" in the ability to differentiate pho¬ 
netic categories perceptually that are not phonologically distinctive in the native lan¬ 
guage, while native contrasts may become more highly differentiated, (p. 19) 

Table 8 sums up the findings of this study for each of the 9 Arabic con¬ 
trasts tested. It is clear that discrimination results for five of the contrasts in 
this study are in line with their relevant PAM predictions. The rest, however, 
contradict PAM, with discrimination in each case being less than expected, 
especially in the UC contrast /h/-/h/. 


Table 8 PAM predictions vs. AE listeners' discrimination of Arabic contrasts 


Contrast 

Assimilation type 

Predicted discriminability 

Attested discrimination level 

/t/-/d/ 

TC 

excellent 

100 

/e/-/a/ 

TC 

excellent 

98.9 

/t/-/t/ 

CG 

moderate to very good 

64.8* 

/5/-/5/ 

CG 

moderate to very good 

72.7 

/h/-/h/ 

UC 

very good 

37.5* 

/k/-/q/ 

UC 

very good 

64.8* 

W-N/ 

UU 

poor 

36.4 

/x/-/h/ 

UU 

very good 

54.5* 

/q/-/x/ 

UU 

very good 

82.9 

Note. Asterisks (*) mark discrimination contradicting PAM, 
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That the UU contrast loj-lyj was significantly better discriminated than 
lyl-lyl and lyj-lhl is not surprising if within-contrast confusability is taken into 
account, as recognized by PAM, although it is not entirely clear how this "prox¬ 
imity" or lack of it factors into the whole process of contrast discrimination. Pre¬ 
sumably, the less similar two segments in a new contrast are to each other, the 
less confusable they become making it easy for listeners to tell which is which. It 
is possible that non-native contrasts involving manner of articulation are more 
distinctive perceptually than place or voicing contrasts: Of the three UU con¬ 
trasts, Iql-lyl is the only one that distinguishes manner of articulation. 

Perhaps another reason why listeners were biased to poorly discrimi¬ 
nate lyj-lyl and lyj-lhl compared to Iql-lyJ is the orthographic similarity 
which the segments share. In Arabic, tyj is typically written as t (when isolat¬ 
ed) or a. (when connected), /y/ as t or A and /h/ as c or^. On the other 
hand, Arabic /q/ is written j or a Table 9 illustrates the orthographic differ¬ 
ences for these pairs. The first column shows the Arabic graphemes as letters 
written in isolation. The next three columns show how each of these graph¬ 
emes is transcribed in connected writing according to its position in the word. 
Phonemes in word initial and medial positions are orthographically more simi¬ 
lar to each other than in final position, which in turn is very similar to the pho¬ 
neme in isolation. The last column presents the IPA symbols often used to 
transcribe the phoneme. As can be seen in Table 9, the phonemic- 
orthographic relation between these graphemes is clear: The uvular fricatives 
tyj, /y/ and the pharyngeal /h/ all resemble each other orthographically in 
initial, medial, and final positions, whereas the distinctive style in which /q/ is 
written sets it apart from others. It is possible, therefore, that the orthograph¬ 
ic disparity between /q/ and hyj and lack of it in lyl-lyl and lyl-lhl may have 
played a role in how learners perceived the contrast in these pairs. 


Table 9 Orthographic representation of the Arabic phonemes: tyj, /y/, /h/and /q/ 


Isolated 


Connected 


i n a 

Initial 

M iddle 

Final 

IPA 

t 


— 

c- 

X 

i 

-C. 


Cr 

Y 

c 



£■ 

h 


J 

s. 


q 


Research has shown that orthography is not an unlikely factor in non¬ 
native acquisition. Atkey (2001), for example, maintains that the orthographic 
conventions of the Czech writing system influence the reading ability of early 
Czech learners. Others have reported effects of orthography on speech per- 
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ception (e.g., Dijkstra, Roelofs, & Fieuws, 1995; Taft & Hambly, 1985; Ziegler & 
Ferrand, 1998), and speech production (Lupker, 1982). 

As for CG /t/-/t/ and UC /h/-/h/ and /k/-/q/, each consisting of one conso¬ 
nant present in the listeners' native language phonology paired up with an¬ 
other new (unfamiliar) albeit distantly related consonant, lesser discriminabil- 
ity or confusability between the consonants may have arisen primarily due to 
the new consonant being well assimilated to an already familiarly known seg¬ 
ment. The pattern in which listeners incorrectly responded on the AXB task 
suggests that they significantly settled the contrast in each of these pairs in 
favor of /t/, /h/, /k/, respectively. That is, they overwhelmingly judged the 
English-like sound as the more similar one. 

This aural penchant for the native segment varied, however, from one 
contrast to another. It was strongest for the pharyngeal-glottal pair /h/-/h/, 
with /h/ being chosen almost 89% out of the 55 incorrect responses reported 
(49/55). Next was the emphatic /t/-/t/ with /t/ adding up to 77.4% (24/31), and 
last was the velar-uvular contrast for which /k/ was selected 64.5% of the time 
(20/31), As such, this bias may have contributed to AE listeners' unexpectedly 
poor performance in the discrimination of these contrasts. 

To conclude, findings of this study suggest the following discriminability 
hierarchy: TC >CG >UC, UU. The TC type contrasts significantly emerged as 
more highly discriminative than the CG, UC, or UU contrasts corroborating the 
TC >CG discriminability ranking introduced by PAM (Best, 1995) and reported 
in Best, McRoberts, and Goodell's (2001) examination of English listeners' per¬ 
ception of Zulu and Tigrinya consonant contrasts. The results of this study are 
also in line with the PAM predictions for the perception of nasal consonants in 
Malayam Marathi and Oriya by Malayalam, Marathi, Punjabi, Tamil, Oriya, 
Bengali and American English speakers (Harnsberger, 2001), non-native sylla¬ 
ble structure and voicing contrast perception by Japanese learners of English 
(Nagao, Lim, & de Jong, 2003), and perception of obstruent word-final voicing 
contrasts by M alay speakers of English (Pilus, 2002). 

Although PAM makes no explicit claims regarding the relative difficulty 
in the discriminability of CG, UC and UU types, which ranges from poor to 
moderate to very good, it states that discrimination for contrasts of the UC 
type should always be "very good." However, AE listeners in this study treated 
UC contrasts no better than CG or UU contrasts. In fact, there was new evi¬ 
dence suggesting better discrimination of CG contrasts compared to UC and 
UU types. Finally, the study has failed to maintain the distinction between UC 
and UU assimilation types, reported in Guion et al. (2000) for the discrimina¬ 
tion of English /r/-/w/ and /r/-/l/, respectively, by Japanese listeners. 
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Conclusion 

This small study tested the predictions of Best's (1995) PAM in the per¬ 
ception of Arabic consonant contrasts by American learners. With some ex¬ 
ceptions, the pattern in which the different Arabic contrasts examined in this 
study were discriminated followed straightforwardly from PAM: TC >CG >UC, 
UU. In general, excellent discrimination was significantly associated with TC 
contrasts. Interestingly, CG contrasts were also significantly more distinctive 
than UC or UU contrasts. No distinction between UC and UU types emerged, 
however; both were poorly discriminated by AE listeners. 

The perceptual assimilation model provides a useful tool in classifying 
non-native contrasts, and predicting the differential success non-native listeners 
achieve in discriminating them. The issue of assessing similarity between native 
and non-native phonemes remains a perennial one, not only to PAM but to any 
model of non-native speech acquisition. One measure of similarity that appears 
to be overlooked by PAM is the orthographic conventions of the target lan¬ 
guage. Evidence from this study suggests a role of orthography in the perception 
of L2 contrasts. Compared to Iql-h/J, the voiceless-voiced contrast in tyj-N 
and the uvular-pharyngeal contrast in ///-/h/ were highly indistinguishable to 
AE listeners presumably due to the orthographic similarity between these pho¬ 
nemes in the Arabic writing system. However, this remains a hypothesis that 
needs to be examined further. Given the implementation of orthography in the 
second language teaching curriculum and its importance as a factor in the per¬ 
ception of non-native phonemes, revisions to have it incorporated into models 
of second language perception ought to be made. 
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APPENDIX 

List of the Arabic contrasts used in the experiment 


Contrast 

Stimuli 

/t/-/d/ 

?a:'ta:-?a:'da: 

/0/-/S/ 

?a:'0a:-?a:'8a: 

/s/-/z/ 

?a:'sa:-?a:'za: 

/s/-/s/ 

?a:'sa:-?a:'sa: 

/t/-/t/ 

?a:'ta:-?a:'ta: 

/6/-/6/ 

?a:'8a:-?a: l 8a: 

/d/-/d/ 

?a:'da:-?a:'da: 

/h/-/h/ 

?a:'ha:-?a:'ha: 

/q/-/k/ 

?a:'qa:-?a:'ka: 

/q/-/?/ 

?a:'qa:-?a:'?a: 

/t/-/d/ 

?a:'ta:-?a:'da: 


?a:'xa:-?a:'Ya: 

/x/-/h/ 

?a:'xa:-?a:'ha: 

/y/-/f/ 

?a:'ya:-?a: l ?a: 

/h/-/?/ 

?a:'ha:-?a:'?a: 

/6/-/s/ 

?a:'8a:-?a:'sa: 

/x/-/?/ 

?a:'xa:-?a: l< i a: 

/y/-/h/ 

?a:'ya:-?a:'ha: 

/q/-/x/ 

?a:'qa:-?a:'xa: 

/q/-/y/ 

?a:'qa:-?a:'Ya: 

/ml-Ini 

?a:'ma:-?a:'na: 

It/ -III 

?a:'ra:-?a:'la: 

Iwl-ljl 

?a:'wa:-?a:'ja: 

/d 3 /-/J/ 

?a:'d 3 a:-?a:'Ja: 
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